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The N-terminal domain of the nucleocapsid protein from Middle East 
respiratory syndrome coronavirus (MERS-CoV NP-NTD) contains many 
positively charged residues and has been identified to be responsible for RNA 
binding during ribonucleocapsid formation by the virus. In this study, the 
crystallization and crystallographic analysis of MERS-CoV NP-NTD (amino 
acids 39-165), with a molecular weight of 14.7 kDa, are reported. MERS-CoV 
NP-NTD was crystallized at 293 K using PEG 3350 as a precipitant and a 94.5% 

o 

complete native data set was collected from a cooled crystal at 77 K to 2.63 A 
resolution with an overall R merge of 9.6%. The crystals were monoclinic and 
belonged to space group P2 U with unit-cell parameters a = 35.60, b = 109.64, 
c = 91.99 A, = 101.22°. The asymmetric unit contained four MERS-CoV NP- 
NTD molecules. 


1. Introduction 

Between 2003 and 2004, Severe acute respiratory syndrome 
coronavirus (SARS-CoV) caused a worldwide epidemic and 
had a significant economic impact in countries affected by 
the outbreak (Lai, 2003). In 2004, another alphacoronavirus, 
Human coronavirus NL63 (HCoV-NL63), was isolated from 
a seven-month-old child in the Netherlands suffering from 
bronchiolitis and conjunctivitis (Pyre et al. , 2004). In 2005, 
Woo and coworkers discovered the novel betacoronavirus 
Human coronavirus HKU1 in patients with respiratory-tract 
infections (Woo et al , 2005). Recently, Middle East respiratory 
syndrome coronavirus (MERS-CoV) has been found in 
patients with severe acute respiratory-tract infections in the 
Middle East (Woo et al , 2014). As is the case for all corona- 
viral infections, there are no efficacious therapies currently 
available against coronaviral diseases, making the develop¬ 
ment of anticoronaviral compounds a priority. The virion 
envelope surrounding the nucleocapsid contains the following 
structural proteins: S (spike), M (matrix), E (envelope) and N 
(nucleocapsid). Some of them have a third glycoprotein, HE 
(haemagglutinin-esterase), which is present in most alpha- 
coronaviruses. The primary function of the HCoV N protein is 
to recognize a stretch of RNA that serves as a packaging signal 
and leads to the formation of the ribonucleoprotein (RNP) 
complex during assembly (Chang et al , 2014). The RNP may 
be important in keeping the RNA in an ordered conformation 
suitable for replication and transcription of the viral genome 
(Lai, 2003; Nelson et al , 2000; Huang et al , 2004; Navas- 
Martin & Weiss, 2004). In addition, the N protein has been 
© 2015 international Union of Crystallography identified as an important diagnostic marker and the most 
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Table 1 

Macromolecule-production information. 


Source organism 
DNA source 


Forward primer 

Reverse primer 

Cloning vector 
Expression vector 
Expression host 
Complete amino-acid sequence 
of the construct produced 


Middle East respiratory syndrome corona- 
virus 

The templates for the MERS-CoV N 
protein were provided by AllBio Science 
Incorporation, Taichung, Taiwan. The 
primers for the MERS-CoV N protein 
were provided by Genomics BioSci & 
Tech Ltd, New Taipei City, Taiwan. 
CTTATCGCATATGAACACCGTGAGCTGGTATACC- 
GGC 

CTTACGG CTCGAG GGTGCCTTCAATATGAAAGTT- 

TTTCG 

pET-28a (Novagen) 
pET-28a (Novagen) 

Escherichia coli BL21 (DE3) pLysS 
HHHHHHSSGLVPRGSHMNTVSWYTGLTQHGKVPL- 
TFPPGQGVPLNANSTPAQNAGYWRRQDRKINT- 
GNGIKQLAPRWYFYYTGTGPEAALPFRAVKDG- 
IVWVHEDGATDAPSTFGTRNPNNDSAIVTQFA- 
PGTKLPKNFHIEGT 


‘immunodominant’ antigen in infected hosts (Chan et al, 2005; 
Woo et al , 2004; Liang et al , 2013). 

The N protein of MERS-CoV, with a molecular weight of 
45.6 kDa and a pi of 10.05, is highly basic and hydrophilic 
(Woo et al , 2014). Previous studies revealed that the 
N-terminal domain of CoV (NP-NTD) contains mostly posi¬ 
tively charged residues, which are responsible for RNA 
binding, while the C-terminal domain (NP-CTD) mainly acts 
as an oligomerization module to form a capsid (Huang et al , 
2009; Saikatendu et al , 2007; Lo et al , 2013; Chen et al , 2013). 
The central disordered region of the N protein has also been 
shown to contain an RNA-binding region (Chang et al , 2009). 
We have shown that compounds that bind to the NP-NTD and 
interfere with NP-RNA interactions provide valuable leads 
for the development of anti-coronaviral therapeutics (Lin 
et al , 2014). The crystal structures of several NP-NTDs, 


M 1 2 



Figure 1 

SDS-PAGE analysis of MERS-CoV NP-NTD stained with Coomassie 
Brilliant Blue. Lane M, protein molecular-mass marker (labelled in kDa); 
lane 1, concentrated MERS-CoV NP-NTD after dialysis; lane 2, purified 
MERS-CoV NP-NTD. 


including those from SARS-CoV, Infectious bronchitis virus 
(IBV), Human coronavirus OC43 (HCoV-OC43) and Mouse 
hepatitis virus (MHV), have been described (Chen et al , 2013; 
Ma et al , 2010; Saikatendu et al , 2007; Jayaram et al , 2006; Yu 
et al , 2006; Fan et al , 2005). In order to clarify the mechanism 
by which the N protein of MERS-CoV bind to nucleic acids, 
we have undertaken the determination of the crystal structure 
of the N-terminal domain of MERS-CoV spanning residues 
39-165, which shares 58% identity to NP-NTD of SARS-CoV. 
The results presented in this paper mainly concern the crys¬ 
tallization and preliminary X-ray structural analysis of MERS- 
CoV NP-NTD. 


2. Materials and methods 

2.1. Macromolecule production 

The templates for the MERS-CoV N protein were 
purchased from AllBio Science Incorporated, Taichung, 
Taiwan. Truncated forms of recombinant MERS-CoV NP- 
NTD were generated by polymerase chain reaction (PCR) of a 
pGENT plasmid encoding the N-protein gene using different 
primers. The PCR products were digested with Ndel and 
Xhol, and the DNA fragments were cloned into pET-28a 
(Novagen) using T4 ligase (NEB). Expression of the protein 
was initiated by adding IPTG to a final concentration of 1 m M 
followed by incubation at 10° C for 24 h. After harvesting the 
bacteria by centrifugation (8000 rev min - , 12 min, 4°C), the 
bacterial pellets were resuspended in lysis buffer (50 m M Tris- 
HC1, 150 m M NaCl, 15 m M imidazole, 1 m M PMSF pH 7.5) 
and lysed by sonication on ice using 3 s pulses with 6 s pauses 
for a total of 10 min. Soluble proteins were obtained from the 
supernatant after centrifugation (13 000 rev min - , 40 min, 
4°C). NP-NTD carrying an N-terminal His 6 tag fused to a 
SSGLVPRGSHM linker sequence was purified using an Ni- 
NTA column (Novagen) and eluted with a buffered imidazole 
gradient of 15-250 m M. Fractions containing pure protein 



Figure 2 

Crystals of MERS-CoV NP-NTD obtained by the sitting-drop vapour- 
diffusion method. The largest crystals in the needle clusters are 
approximately 300 x 20 x 10 pm in size. 
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were collected at ~150 m M imidazole and were dialyzed 
against 50 m M Tris-buffered solution at pH 7.5 containing 
150 m M NaCl for 3 h at 4°C (Fig. 1). The purified NP-NTD 
was further concentrated using Amicon Ultra centrifugal filter 
units and centrifuged at 3500g for 10 min at 4°C several times 

_t 

until the concentration of NP-NTD reached 10 mg ml as 
determined by the Bradford method (BioShop Canada Inc.). 
Macromolecule-production information is summarized in 
Table 1. 

2.2. Crystallization 

Initial crystallization conditions were identified by using the 
sitting-drop vapour-diffusion method with crystal screening 
kits from Molecular Dimensions as described previously (Till 
et al , 2013; Chen et al , 2014). Each of the solutions (2 pi) from 
the crystal screening kits was mixed with 2 pi purified protein 
solution (10 mg ml - ) and allowed to equilibrate against 
300 pi solution in the well at room temperature (^25°C). The 
conditions were refined and crystals were grown from a well 
solution using the sitting-drop vapour-diffusion method by 
equilibrating a mixture of 2 pi protein solution (10 mg ml - ) 
and 2 pi reservoir solution against 300 pi reservoir solution 
consisting of 2 m M NaBr, 75 m M ammonium sulfate, 29% 
PEG 3350 (Sigma). Crystals appeared within two weeks and 
the largest crystal in the needle clusters grew to dimensions of 
approximately 300 x 20 x 10 pm (Fig. 2). Crystallization 
information is summarized in Table 2. 

2.3. Data collection and processing 

X-ray data were collected using synchrotron radiation with 
a crystal-to-detector distance of 350 mm. The oscillation width 
and exposure time for each frame were 1° and 20 s, respec¬ 
tively. Crystallographic data integration and reduction were 


Table 2 
Crystallization. 


Method 

Vapour diffusion 

Plate type 

24-well sitting-drop plate (Hampton 
Research) 

Temperature (K) 

293 

Protein concentration (mg ml -1 ) 

10 

Buffer composition of protein 
solution 

50 m M Tris-HCl pH 7.5, 75 m M NaCl 

Composition of reservoir solution 

2 mM NaBr, 75 m M ammonium sulfate, 
29% PEG 3350 

Volume and ratio of drop 

1:1; 2 pi reservoir solution was mixed 
with 2 pi purified protein solution 

Volume of reservoir (|il) 

300 


performed with the HKL -2000 program package (Otwinowski 
& Minor, 1997). The crystallographic data-collection statistics 
for NP-NTD are listed in Table 3. 

3. Results and discussion 

The MERS-CoV NP-NTD crystal chosen for this study 

o _ 

diffracted to 2.63 A resolution (Fig. 3) and belonged to space 
group P2 U with unit-cell parameters a = 35.60, b = 109.64, 
c = 91.99 A, = 101.22°. The Matthews coefficient of 
2.63 A 3 Da -1 calculated using MATTHEWS_COEF from 
CCP4 (Winn et al , 2011; Matthews, 1968) suggested that there 
were four molecules in an asymmetric unit with a solvent 
content of 59.2%. A homology search for the MERS-CoV 
NP-NTD structure was performed using the BLAST server 
(http://blast.ncbi.nlm.nih.gov/Blast.cgi). The model-selection 
criterion was based on the E-value and the estimated precision 
value. The sequence-alignment search indicated that MERS- 
CoV NP-NTD shares high identity with other NP-NTDs from 
coronaviruses (Supplementary Fig. SI). For example, the 
MERS-CoV NP-NTD (residues 39-165) has 58% sequence 



Figure 3 

Typical X-ray diffraction pattern of MERS-CoV NP-NTD. 
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Table 3 

Data collection and processing. 


Values in parentheses are for the outer shell. 


Diffraction source 

BL13B1 beamline, National Synchrotron 
Radiation Research Center (NSRRC), 
Hsinchu, Taiwan 

Wavelength (A) 

1.000 

Temperature (K) 

77 

Detector 

ADSC Q315r 

Crystal-to-detector distance (mm) 

350 

Rotation range per image (°) 

1 

Total rotation range (°) 

360 

Exposure time per image (s) 

20 

Space group 

P2.i 

a, b, c (A) 

35.60, 109.64, 91.99 

a, F y (°) 

90, 101.22, 90 

Mosaicity (°) 

1.053 

Resolution range (A) 

30-2.63 (2.73-2.63) 

Total No. of reflections 

138439 

No. of unique reflections 

13974 

Completeness (%) 

94.5 (91.4) 

Multiplicity 

7.1 (5.9) 

( 7 / 0 ( 7 )) 

18.89 (3.03) 


0.01 (0.08) 

Overall B factor from Wilson 
plot (A 2 ) 

43.1 


identity to SARS-CoV NP-NTD. The NTD from SARS-CoV 
(PDB entry 2ofz; Saikatendu et al., 2007) was chosen as the 
initial model as its E-value was 1 x 10 -23 , and a total of 116 
residues were modelled. The first molecular-replacement trial 
was performed using the PE RON automated interface at the 
Protein Tectonics Platform (PTP), RIKEN SPring-8 Center, 
Japan (Sugahara et al ., 2008). The core of the model consisted 
of a tightly packed /3-sheet surrounded by large loops. The 
molecular-replacement method was then applied to the model 
using MOLREP (Vagin & Teplyakov, 2010) using reflections 

o 

in the resolution range 8.5-3.0 A. Single and unambiguous 
solutions for four NP-NTD molecules in one asymmetric unit 
were obtained in the rotation and translation functions, 
yielding a final correlation coefficient of 0.81 and an R factor 
of 0.32. Structural refinement of MERS-CoV NP-NTD is 
currently in progress. 
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